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Nucleic Acid Molecule, comprising a Nucleic Acid Coding for a Polypeptide 
with Chorismate Mutase Activity 

The present invention concerns a nucleic acid molecule, comprising a nucleic acid coding for 
a polypeptide with chorismate mutase activity and derivatives thereof, wherein the 
derivatives show at least 10% of the chorismate mutase activity of the chorismate mutase 
according to SEQ ID NO: 2. The invention fiirther concerns vectors containing nucleic acid 
molecules, host cells containing nucleic acid molecules and processes for the production of 
polypeptides with chorismate mutase activity. The present invention fiirther concerns the 
polypeptides with chorismate mutase activity and antibodies which specifically recognise 
these. In addition, the invention concerns processes for the production of auxotrophic yeast 
strains by means of the nucleic acid molecules and the use thereof in processes for the 
recombinant expression of heterologous genes. 

As single-cell, eukaryotic microorganisms, yeasts can be cultured without difficulty and have 
the great advantage of being readily genetically manipulable. Also, they can process and 
modify recombinant proteins in accordance with the patterns known from higher organisms. 
As far as is yet known, they contain no pathogenic substances and are therefore also suitable 
for the production of therapeutic proteins. Thus for example the first vaccine produced by 
heterologous gene expression, the hepatitis B vaccine, was heterologously expressed in the 
well-characterised baker's yeast Saccharomyces cerevisiae (Lepetic et al, 1996). 

While S. cerevisiae makes it possible to produce many different proteins (for a review, see 
Gellissen and Hollenberg, 1997), there are nonetheless also a few limiting properties. Thus 
for example the maximal content of heterologous protein is about 1-5% of the total protein 
content of the cell (Buckholz and Gleeson, 1991). 

Various other yeasts, such as for example Schizosaccharomyces pombe, Kluyveromyces 
lactis, Yarrowia Upolytica and also Hamemila polymorpha have been characterised and 
compared with S. cerevisiae in terms of their suitability as expression systems (Miiller et ai., 
1998). All these yeasts showed markedly stronger secretion of active protein than 5. 
cerevisiae, the heterologous gene expression being dependent on the particular gene, but 
independent of the donor organism. Basically, methylotrophic yeasts are found to be 
attractive organisms for the production of recombinant proteins. The methylotrophic yeasts 
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are subdivided into the four genera Hansenula, Pichia, Candida and Torulopsis, which all 
possess the ability to utilise methane, methylamine, formaldehyde or formate as carbon and 
energy source. 

Inter alia, the yeast Hansenula polymorpha from the Saccharomycetaceae family (Lodder, 
1970) belongs to the relatively small group of the methylotrophic yeasts. Hansenula 
polymorpM does not ferment glucose under aerobic conditions, and is thus Crabtree-negative 
(Verduyn et al., 1992) and with a growth temperature optimum of 37*^0 is classed among the 
thermotolerant yeasts; thus it is an exception among the methylotrophic genera. The natural 
habitat of the methylotrophic yeasts is locations rich in organic material. 

Hitherto, only a few genes of the yeast Hansenula polymorpha had been cloned and charact- 
erised (Hansen and HoUenberg, 1996), and according to GenBank the number of cloned 
genes in Hansenula polymorpha amounts to > 30. However, how many of these are 
thoroughly characterised or are suitable as marker genes for plasmid selection, is unknown to 
me. Hence, I would suggest that similarities of these genes with the homologous genes in 
Saccharomyces cerevisiae, insofar as these are present, are relatively limited (Dobson et al., 
1982). The key enzymes in methylotrophic metabolism are the enzymes methanol oxidase 
(MOX), dihydroxyacetone synthase (DAS) and formate dehydrogenase (FMD), whose 
controlled expression by very strongly regulated promoters opens up a variety of useful 
possibilities for heterologous gene expression. These facts make Hansenula polymorpha an 
industrially interesting organism (Gellissen et al., 1994). 

So far, only two auxotrophic strains ofH. polymorpha are available, whose transformation 
can be selected by functional complementation of ura3- or leu2- deficiency respectively. The 
provision of further transformation systems, consisting of auxotrophic strains and of nucleic 
acids capable of complementing the auxotrophy hitherto failed because suitable genes 
capable of this complementation were not available. While a number of genes from the 
amino acid or nucleic acid biosynthesis pathway of the baker's yeast S. cerevisiae were 
known, it had nonetheless in the past been found that the differences between the genes of 
baker's yeast and the methylotrophic yeasts are sometimes considerable. Hence it cannot in 
general be expected that a S, cerevisiae gene will be suitable for complementation of an 



auxotrophy in a methylotrophic yeast, nor can it be expected that genes from a methylo- 
trophic yeast will be capable of complementing an auxotrophy in S. cerevisiae. 



Hence a technical problem underlying the present invention is to pro^de a new gene from H. 
polymorpha, which can serve as a selectable marker for complementation in the 
transformation of suitable auxotrophic yeast strains. A fiirther problem consists in the 
provision of vectors and host cells, which contain the gene. Further, it is to provide the 
polypeptide encoded by the gene and antibodies which specifically recognise the polypeptide. 
Finally, it is to provide suitable auxotrophic strains. 

This problem is solved according to the invention by a nucleic acid molecule comprising a 
nucleic acid coding for a polypeptide with chorismate mutase activity or the complementary 
strand thereof, wherein the nucleic acid is selected from 

(a) a nucleic acid with the DNA sequence stated in SEQ ID NO: 1 or the RNA sequence 
corresponding thereto; 

(b) a nucleic acid which hybridises with the complementary strand of a nucleic acid 
according to (a); 

(c) a nucleic acid which on the basis of the genetic code is degenerate to the DNA sequences 
defmed under (a) and (b); 

(d) a nucleic acid which hybridises with one of the nucleic acids stated in (a) to (c) and the 
complementary strand thereof codes for a polypeptide with chorismate mutase activity; 

(e) a nucleic acid which is at least 60% homologous to one of the nucleic acids stated in (a) 
to(d); 

(f) a variant of the nucleic acids stated in (a) to (e), wherein the variant had additions, 
deletions, insertions or inversions relative to the nucleic acids stated in (a) to (e); 

(g) a fragment of one of the nucleic acids stated in (a) to (f); 



(h) a combination of several of the nucleic acids stated in (a) to (g), 

wherein the polypeptide encoded by the nucleic acid or complementary strand thereof has at 
least 10% of the chorismate mutase activity of the chorismate mutase according to SEQ JD 
N0:2. 

In microorganisms and plants, the biosynthesis of aromatic amino acids proceeds firstly 
through 7 enzyme-catalysed reactions of the shikimate pathway from erythrose-4-phosphate 
and phosphoenol pyruvate to chorismate (figure 1). Chorismate is the substrate of the first 
branching point. In the baker's yeast Saccharomyces cerevisiae, starting from this branching 
point, on the one hand anthranilate is formed via the enzyme anthranilate synthase (E.C. 
4.1.3.27), and on the other prephenate via the enzyme chorismate mutase (E.C. 5.4.99.5). 
Finally, via fiirther intermediate products, tyrosine and phenylalanine are formed from 
prephenate (Braus, 199 1). In the baker's yeast Saccharomyces cerevisiae, the chorismate 
mutase is encoded by theAROJ gene (Schmidheini et al, 1989), which is located on 
chromosome XVI. AR07 encodes a 0,95 kb mRNA and contains a 771 bp open reading 
frame, which codes for a protein consisting of 256 amino acids. 

Below, a number of terms are explained in more detail, in order to make clear how they 
should be understood in connection with the present application. 

The term "chorismate mutase", as used below in the description, includes complete 
chorismate mutase, chorismate mutase fragments, chorismate mutase mutants and fusion 
proteins thereof Polypeptides which have at least 10% of the chorismate mutase activity of 
the chorismate mutase according to SEQ ID N0:2 are regarded as being according to the 
invention. 

"Chorismate mutase activity" means the catalytic conversion of chorismate to prephenate as 
part of the phenylalanine and tyrosine biosynthesis which is catalysed by the enzyme chor- 
ismate mutase [E.C. 5.4.99,5], The chorismate mutase activity of the polypeptide according 
to the invention can for example be measured spectrophotometrically through the acid- 
catalysed conversion of the product prephenate to phenylpyruvate, which absori)s at 320 nm 
(Schmidheini et al, 1989). 
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For its growth or proliferation, a "prototrophic'' imcroorganism needs only simple nutrients 
(carbon and nitrogen) and minerals, but can itself build up all needed amino acids. Hence it 
is capable of growing on "minimal medium". 

In contrast to this, "auxotrophic" microorganisms need additional factors, for example amino 
acids, which they cannot themselves synthesise owing to a defect in the biosynthesis pathway 
for the fector concerned. In connection with the present invention, the auxotrophy is 
preferably a phenylalanine/tyrosine auxotrophy, which is caused through diminished or 
absent chorismate mutase activity, e.g. of an auxotrophic yeast strain prepared according to 
the invention. 

"Minimal medium" means a nutrient solution, which contains only the components which are 
necessary for the growth of a prototrophic microorganism. In connection with the present 
invention, the minimal medium is preferably a phenylalanine- and/or tyrosine-free medium, 
through which the selection of the phenylalanine/tyrosine auxotrophic form of the micro- 
organism compared to the prototrophic form of the microorganism is made possible. 

"His" tag means a sequence of at least 6 histidine amino acids, which by appropriate cloning 
and fusion with an expressible sequence leads to a fiision protein with at least 6 His residues 
at the NH2 teminus, which can easily be purified by complexing with a Ni^"" column. 

A "heterologous gene" is understood to mean the coding region of a structural gene, which is 
either not expressed under the control of its own (homologous) promoter or not in the 
organism from which it is derived, or else is expressed neither under the control of its own 
promoter nor in the original organism. 

"Cloning" is intended to include all cloning methods known in the state of the technology, 
which could be used here, which are however not all described in detail, since they are among 
the obvious tools of the skilled person. 

"Recombinant expression in a suitable host cell" is to be understood to mean all expression 
methods known in the state of the technology in known expression systems, which could be 
used here, which are however not all described in detail, since they are among the obvious 
tools of the skilled person. 



By selection and sequencing of a genomic clone from Hcmsemla polymorpha which after 
transfonnation is capable of functionally complementing the phenylalanine/tyrosine auxo- 
trophy of the Saccharomyces cerevisiae aro7A-deletion strain, the inventors have for the first 
time succeeded in identifying a nucleic acid according to the invention, and producing 
auxotrophic mutants of methylotrophic yeasts with this nucleic acid. Thus a further urgently 
needed transformation system for methylotrophic yeasts is provided, which makes the 
targeted selection of transformed yeasts with simple media possible. 

The inventors have prepared a genonuc bank from K polymorpha by cloning of restriction 
fragments of chromosomal K polymorpha DNA into a shuttle vector, the said shuttle vector 
having a high copy number in yeast and the Hansenula genes being expressed in 5. cerevisiae 
under the control of their endogenous promoter. This genomic bank was used for the 
complementation of auxotrophic Saccharomyces cerevisiae strains, although it was not 
known each endogenous promoter of Hansenula polymorpha is active in S. cerevisiae. 

Surprisingly, it was now found that the chorismate mutase gene from Hansenula polymorpha 
is transcribed and translated in S, cerevisiae and is enzymatically active. 

For several reasons, it was not to be expected that the chorismate mutase gene in particular 
could be expressed in Saccharomyces cerevisiae. 

Thus it was not known whether the chorismate mutase gene from K polymorpha contains 
introns and whether the S, cerevisiae splicing apparatus is capable of the correct processing 
of the possible pre-RNA. From now on, it could be shown that the chorismate mutase gene 
from H. polymorpha has no introns. 

Further, it could not be foreseen that the translation product can be correctly folded and if 
necessary assembled in SI cerevisiae,, in order to give a sufficiently active chorismate mutase 
which is capable of catalysing the reaction of chorismate to prephenate. The absence of 
predictability is supported by the fact that HLEU2, the LEU2 homologue from H 
polymorpha,, is not capable of complementing a corresponding mutation in S. cerevisiae 
(Agaphonov et al., 1994). 



Further, it was not known that the biosynthesis of the aromatic amino acids tyrosine and 
phenylalanine in K pofymorpha involves the conversion of chorismate to prephenate with the 
aid of the enayme chorismate mutase at all. In fact it is known that there are considerable 
differences in the biosynthesis of aromatic amino acids. Thus in some cyanobacteria 
prephenate is first converted to arogenate by transamination. This pathway is also used by 
plants (Jensen and Stenmark, 1975) and is more widespread in nature than the hydroxy- 
phenylpyruvate/phenylpyruvate pathway, which is used by Saccharomyces cerevisiae and 
E. colt. 

The nucleic acid contained in the nucleic acid molecule according to the invention can be 
genomic DNA, cDNA or synthetic DNA, where a synthetic DNA sequence is also 
understood to mean one such which contains modified intemucleoside bonds. Furthermore 
the nucleic acid can be an RNA sequence, which e.g. can be necessary for expression by 
means of recombinant vector systems. The nucleic acid according to (b) is for example 
obtainable by use of a detectably labelled probe which corresponds to one of the sequences 
stated in (a) or a fragment or complementary strand thereof, for the screening of cDNA or 
genomic DNA libraries from microorganisms. Preferably the microorganism from which the 
bank is created belongs to a genus selected from: Pichia, Hamenula, Candida, Tomlopsis, 
Saccharomyces, Schizosaccharomyces, Kluyveromyces and Yarrowia. In particular, the 
microorganism belongs to a species selected from: Hansemla polymorpha, Saccharomyces 
cerevisiae, Schizosaccharomyces pombe, Kkfyveromyces lactis and Yarrowia iipofytica. In 
this, the identification of positive cDNA or genomic DNA clones is performed according to 
standard procedures. See Maniatis et al.. Molecular Cloning (1989), Cold Spring Harbor 
Laboratory Press. 

In a preferred embodiment, the hybridisation stated under (b) or (d) is performed under 
stringent conditions. Stringent hybridisation conditions are e.g. incubation at 65''C overnight 
in 7% SDS, 1% BSA, ImM EDTA, 250 mM sodium phosphate buffer (pH 7.2) and 
subsequent washing at 65°C with 2 x SSC; 0.1% SDS. 

In a preferred embodiment, nucleic acids are provided which are at least 60% homologous to 
the nucleic acid sequence stated in (a). Preferred are nucleic acids at least 80% homologous 
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to the nucleic acid sequence stated in (a). Especially preferred are nucleic acids at least 90% 
or 95% homologous to the nucleic acid sequence stated in (a). 



According to the invention, the expression "homology" means homology at the DNA level, 
which can be determined by known procedures, e.g. computer-aided sequence comparisons 
(Basic local alignment search tool, S. R Altschul et al., I Mol, Biol. 215 (1990), 403-410). 

The expression "homology", well-known to the skilled person, designates the degree of 
relatedness between two or more nucleic acid molecules, which is determined by the agree- 
ment between the sequences. The "percentage homology" is obtained from the percentage of 
identical regions in two or more sequences taking account of gaps or other sequence features. 

The homology of mutually related nucleic acid molecules can be determined by means of 
known procedures. As a rule, special computer programs with the algorithms taking account 
of the special requirements are used. 

Preferred procedures for the determination of homology firstly generate the greatest agree- 
ment between the sequences studied. Computer programs for determination of homology 
include, but are not limited to, the GCG program package, including GAP (Devereux, J., et 
al, Nucleic Acids Research 12 (12): 387 (1984); Genetics Computer Group University of 
Wisconsin, Madison, (WI)); and BLASTP, BLASTN and FASTA (Altschul S., et al., J. Mol. 
BioL, 215: 403-410 (1990)). The BLASTX program can be obtained from the National 
Centre for Biotechnology Information (NCBI) and from other sources (BLAST Handbook, 
Altschul S., et al., NCB NLM NIH Bethesda MD 20894; Altschul S., et al, J. Mol Biol, 
215: 403-410 (1990)). The well-known Smith-Waterman algorithm can also be used for the 
determination of homologies. 



Preferred parameters for the nucleic acid sequence comparison include the following: 



Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 

Comparison matrix: Matches = -f 10 

Mismatches = 0 
Gap penalty 50 
Gap length penalty: 3 



The GAP program is also suitable for use with the above parameters. The above parameters 
are the default parameters for nucleic acid sequence comparisons. 

Further examples of algorithms, gap opening penalties, gap extension penalties and 
comparison matrices including those named in the program handbook, Wisconsin package. 
Version 9, September 1997, can be used. The choice will depend on the comparison to be 
performed and further on whether the comparison is performed between sequence pairs, 
when GAP or Best Fit are preferred, or between one sequence and a large sequence database, 
when FASTA or BLAST are preferred. 

An agreement of 60% determined with the aforesaid algorithm is described in the context of 
the present application as 60% homology. The same applies for higher degrees of homology. 

In a preferred embodiment, the nucleic acid according to the invention is a combination of 
several of the nucleic acids stated under (a) to (f), which can be obtained by the fusion and if 
necessary cloning known to the skilled person. These combinations can e.g. be of especial 
interest for the creation of immunogenic constructs. 

In a further preferred embodiment, the polypeptide encoded by the nucleic acid according to 
the invention has at least 50% of the chorismate mutase activity of the chorismate mutase 
SEQ ID N0:2. It is especially preferred that the polypeptide has at least 75% of the chorism- 
ate mutase activity according to SEQ ID N0:2. Here, as already stated, the chorismate 
mutase activity is measured after Schmidheini et al., 1989. 

In a fiirther embodiment, the nucleic acid molecule comprises a promoter suitable for its 
expression, the nucleic acid being under the control of the promoter. The choice of the 
promoter depends on the expression system used for expression, in particular on the choice of 
host organism. Preferred according to the invention are both constitutive promoters such as 
PGK- and GSPDH- promoters and also inducible promoters such as GAL4, ADH2 and the 
Cu-metallothionein promoter (survey in Recombinant Gene Expression Protocols in Methods 
in Molecular Biology Vol.63, Ed. R S Tuan, Chapter IE; Schena, M. et al, (1991) Vectors for 
constitutive and inducible expression in yeast. Methods in Enzymol. 194, 389-398). Further, 
the use of mammalian glucocorticoid response elements (GRE) for increasing transcription is 
also taken into account by the invention (Schena et al.. Science 241 (1988), 965-967). 
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Also preferred are the promoters of the methanol metabolism of methylotrophic yeasts, in 
particular Hansenula polymorpha. The genes for the methanol metabolism enzjones in 
Hcmsemla polymorpha are among the most strongly expressed and reflated genes that have 
so far been described in yeasts (Van d^ Klei et al., 1991). The corresponding proteins can 
comprise up to 30% of the total protein content of the cell (Janowicz et al., 1985; Ledeboer et 
al, 1985). The expression of the methanol metabolism genes can on the one hand be induced 
by methanol, on the other hand however, the presence of glycerol also leads to derepression. 
Thus in this way the strong promoters of the methanol metabolism can also be used for 
heterologous gene expression under methanol-free culture conditions. 

Especially preferred promoters are the methanol oxidase A/OX-promoter described in //. 
polymorpha^ which at about 1.5 kb is unusually large and is among the strongest yeast 
promoters so far described. The presence of glucose leads to repression of the MOX- 
promoter, however the activity of this promoter can be increased more than 1000-fold by 
glycerol or methanol (Godecke et al, 1994). Further especially preferred is the dihydroxy- 
acetone synthase D^5-promoter (Ledeboer et al,, 1985) and the FA4D-promoter (European 
Patent 299108). 

In a further preferred embodiment, the nucleic acid molecule also comprises a signal peptide- 
coding nucleic acid sequence, which ensures the export of the expressed protein, where the 
signal peptide-coding nucleic acid sequence is preferably directly 5' bound to the hetero- 
logous gene to be expressed. For the secretion and modification of many eukaryotic proteins, 
it is necessary to fiise the protein sequence at the N-terminus with a signal sequence, in order 
to steer the polypeptides into the secretion apparatus. Possible for this are for example 
components from the X occidentalis gene GAM I (Dohmen et al., 1990) and from a hormone 
gene of the crab Carcinus maenas (Weydemann et al, 1989), which were successfully used 
for the secretion of hirudin (Weydemann et al., 1995), 

In a &rther preferred embodiment, the nucleic acid molecule also comprises at least a part of 
a vector, in particular control regions, wherein the vector can be selected from: 
bacteriophages such as ^derivatives, plasmids, adenoviruses, vaccinia viruses, baculoviruses, 
S V40 viruses and retroviruses, preferably MoMuLV (Moloney Murine Leukemia Virus). 
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Especially preferred are yeast transformation vectors, both integrative yeast plasmids (Yip) 
and also extrachromosomal plasmid vectors being possibilities. The extrachromosomal 
plasmid vectors subdivide into episomal yeast plasmids (YEp), replicative yeast plasmids 
(YRp) and yeast centromer plasmids (YCp) (see Singh, K.K, and Heinemann, I A., Chapter 
1 1 in Recombinant Gene Expression Protocols, v. supra). Further, artificial yeast chromo- 
somes (YACs) are also possible as expression vectors according to the invention. 

Also especially preferred vectors are yeast replication plasmids, which contain a replication 
origin ori and an antibiotic resistance cassette, so that they can be propagated and selected in 
E, coll. Furthermore, they bear an ARS sequence for chromosomally independent replication 
in the yeast cell, such as for example HARSI from K pofymorphay and a metabolic yeast 
selection marker, such as for example URA3 oxHLEUl (Gellisen and HoUenberg, 1997). 

Many heterologous proteins have already been produced in H, polymorpha (Gellissen and 
HoUenberg, 1997) and by the placement of more than one expression cassette on one trans- 
formation vector, co-expression of different genes is also possible (Gilbert et al., 1994). 

Further, a nucleic acid molecule is preferred which in addition comprises a His-tag coding 
DNA sequence, which on expression of the construct leads to the formation of a fiision 
protein with a His-tag at the NH2 terminus, which facilitates the purification of the protein on 
a nickel column through chelate formation. 

According to the invention, host cells are provided, which contain the nucleic acid molecule 
and which are suitable for the expression of the heterologous gene. In the state of the 
technology, many prokaryotic and eukaryotic expression systems are known, where the host 
cells are for example selected from prokaryotic cells such as £. coli or B. subtilis, from 
eukaryotic cells such as yeast cells, insect cells and mammalian cells, e.g. CHO cells, COS 
cells or HeLa cells, and derivatives thereof In the state of the technology for example, 
certain CHO production lines are known whose glycosylation patterns are altered compared 
to CHO cells. 

Preferably, the yeast belongs to a genus selected from; Pichia, Hamemtla, Candida, 
Tomlopsis, Saccharomyces, Schizosaccharomyces, Kluyveromyces and Yarrowia, In 
particular, the microorganism belongs to a species selected from: Hamemla polymorpha^ 
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Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces lactis and Yarrowia 
lipolytica (see Recombinant Gene Expression Protocols in Methods in Molecular Biology, 
above). 

A further object of the present invention is a process for the production of a polypeptide with 
chorismate mutase activity. For this, the nucleic acid molecule according to the invention is 
expressed in a suitable host cell and the protein isolated from the host cell or the medium by 
standard procedures. 

Many procedures for the expression of DNA sequences are known to the skilled person; see 
Recombinant Gene Expression Protocols in Methods in Molecular Biology, above. The 
expression can be both constitutive and also inducible, inducers such as for example IPTG 
and Zn^"^ being known to the skilled person. The polypeptide with chorismate mutase activity 
produced can, if a His-tag was fiised to the NH2 terminus of the pol3^eptide, be purified by 
chelate formation on a nickel column. Preferably the polypeptide with chorismate mutase 
activity is purified by ion exchange chromatography and/or gel filtration chromatography. 
The implementation of these measures is well-known to the skilled person. 

In a further preferred embodiment, the polypeptide with chorismate mutase activity produced 
according to the invention is modified. Here the modifications include the di-, oligo- and 
polymerisation of the monomeric starting product for example through crosslinking, eg. with 
dicyclohexylcarbodiimide or pegylation or association (self-assembly). Other modifications 
include side-chain modifications, for example of E-amino-lysine residues of the polypeptide, 
or amino- or carboxy-terminal modifications. Other modifications include posttranslational 
events, e.g. the glycosylation or the partial or complete deglycosylation of the protein. 

In a preferred embodiment, the polypeptide obtained in recombinant expression in 
prokaryotes or glycosylation-deficient eukaryotes is non-glycosylated. Also taken into 
account according to the invention is a polypeptide with chorismate mutase activity which is 
glycosylated by recombinant expression in eukaryotes capable of glycosylation such as yeast 
cells, insect cells or mammalian cells, such as CHO cells or HeLa cells. 
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In a fizrther embodiment, polypeptides with chorismate mutase activity are made available, 
which include an amino acid sequence, the amino acid sequence being encoded by one or 
several of the nucleic acid molecules according to the invention. 

Preferably, polypeptides with chorismate mutase activity are made available, which comprise 
the amino acid sequence in SEQ ID N0:2 or a fragment thereof, which has at least 10% of 
the chorismate mutase activity of the chorismate mutase according to SEQ ID N0:2, 
preferably more than 50% and especially preferably more than 75%. 

In a further embodiment, the invention provides polypeptides with chorismate mutase 
activity, obtainable by the recombinant production process or modifications thereof 

Further, non-glycosylated and glycosylated polypeptide with chorismate mutase activity, 
obtainable by expression in host cells capable or incapable of glycosylation, is provided. 
Depending on the intended use of the polypeptide, the glycosylation pattern of yeast, in 
particular methylotrophic yeast such as Hansenula polymorpha, of COS or HeLa cells can be 
preferred. 

The invention further provides antibodies which specifically react with the polypeptide with 
chorismate mutase activity according to the invention and are obtainable by immunisation of 
an experimental animal with the polypeptide. Polyclonal antibodies can be obtained by 
immunisation of for example rabbits, mice or rats and subsequent extraction of antisera. 
Monoclonal antibodies can be obtained by standard procedures by immunisation of e.g. mice, 
extraction and immortalisation of the spleen cells and cloning of the hybridomas which 
produce antibodies specific for the polypeptide. 

In a further embodiment, the invention provides a process for the production of a phenyl- 
alanine and tyrosine-auxotrophic yeast strain, comprising the destruction of the endogenous 
chorismate mutase gene of the corresponding yeast strmn, wherein the mutant displays less 
than 10% of the chorismate mutase activity of the chorismate mutase according to SEQ ID 
N0:2. 

According to the invention, homologous recombination is possible as a procedure for the 
production of the mutant. The disruption of the chorismate mutase gene by homologous 
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recombination is based on the replacement of the endogenous chromosomal copy of the gene 
by an inactivated copy. For the preparation of a disruption construct, the cloning of larger 
regions of the gene to be destroyed, is necessary for the efficient homologous recombination, 
preferably including the 5' and 3' regions. The cloned regions should preferably cover at 
least 2 kb. 

A Bgin digestion of genomic DNA from Hansemla polymorpha and subsequent Southern 
Blot analysis v\^ith a chorismate mutase-specific probe such as e.g. SEQ ED N0:3 gave two 
bands of 3.2 kb and 3.0 kb respectively. Further investigation showed that the 3.2 kg Bgin/ 
Bglll fragment includes 690 bp of SEQ ID N0:3 and contains the flanking 5' region. The 
3.0 bp Bgin fragment includes 960 bp of SEQ ID N0:3 and contains the flanking 3' region. 
The isolation and directed fiision of the 3.2 kb fragment with the 3.0 kb fragment by means of 
standard procedures known to the skilled person leads to a 6.2 kb fragment, which includes 
the chorismate mutase gene and large 5' and 3' flanking regions. 

According to the invention, the nucleic acid to be destroyed, preferably a nucleic acid 
according to SEQ ID NO: 1 or a nucleic acid homologous thereto, especially preferably the 
nucleic acid according to SEQ ED N0:3 or a nucleic acid homologous thereto, in particular 
the nucleic acid including the 6.2 kb genomic DNA fragment, a nucleic acid homologous 
thereto or parts thereof, is cloned and mutated by oligonucleotide exchange, wherein the 
mutation can include additions, deletions, inversions or substitutions, which decrease the 
expression of the gene or lead to an inactive translation product. The mutation preferably 
takes place in at least one region or parts thereof, which are selected from the nucleic acid 
positions corresponding to the amino acid positions (SEQ ID N0:2) 10 to 20, 154 to 167, 192 
to 196 and 240 to 247; these regions supposedly form the catalytic centre of the chorismate 
mutase. It is preferable to inactivate the nucleic acid coding for the chorismate mutase by 
cloning in a longer oligonucleotide. 

Preferably the endogenous chorismate mutase gene is present as a single copy gene. 

Preferably the oligonucleotide used for the oligonucleotide exchange comprises at least one 
selectable marker such as an antibiotic resistance or a metabolic marker. According to the 
invention, all selectable markers known in the state of the technology are included. 
Alternatively, the construct can also be prepared synthetically. For efficient homologous 
recombination, the construct should have a length of at least 2 kb. 
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The cloned-in oligonucleotide is flanked both 5' and 3' by chorismate mutase-specific 
fragments, whose sequences correspond to the sequences stated in claim 1 (a) to 1 (h) or are 
complementary to these and can hybridise with the original chromosomal copy. 

The construct is first linearised for the production of a disruption mutant. Next, the proto- 
trophic yeast strain intended for the production of the disruption mutant is transformed with 
the construct. Since the construct comprises a selectable marker, the transformants are 
selected by appropriate selection pressure. The phenylalanine/tyrosine-auxotrophic 
transformants are identified by growth on phenyIaIanine/tyrosine-fi*ee medium. 

In a preferred embodiment, the process comprises the steps: 

a) Preparation of a construct comprising at least two ifragments of a nucleic acid according to 
claim 1 (a) to 1 (h) suitable for homologous recombination, which flank a nucleic acid 
unsuitable for homologous recombination; 

b) Transformation of cells of a yeast strain with an intact endogenous chorismate mutase 
gene with this construct 

and 

c) Identification of the phenylalanine and tyrosine-auxotrophic transformants. 

Especially preferably, the construct also comprises a selection marker gene. Further the 
construct can comprise one or several recombination sites, which preferably 5' and 3' flank 
the selectable marker and make possible the excision of the selectable marker fi-om the 
construct after disruption of the endogenous chorismate mutase gene has been effected. 

Preferably the recombination site is loxP. In an especially preferred embodiment, the process 
further includes in step b) that the cells of the yeast cells in step b) are brought into contact 
with nucleic acid suitable for the expression of Cre-recombinase, as a result of which the 
excision of the selectable marker by means of the loxP/Cre-recombinase system is made 
possible. 
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In an especially preferred embodiment, the yeast strain is Hansemla polymorpha. 

In a further embodiment, the invention provides a phenylalanine and tyrosine-auxotrophic 
yeast strain, such as is obtainable e.g. by the process according to the invention. 

In a fiirther embodiment, a process for the recombinant production of proteins is provided, 
which includes: 

a) Transformation of an auxotrophic yeast strain according to the invention with a 
combination of a nucleic acid according to the invention suitable for expression and a 
heterologous gene suitable for expression under the control of a suitable promoter; 

b) Culturing of the transformants under conditions suitable for the expression of the 
heterologous gene and of the nucleic acid molecule according to the invention and if 
necessary isolation of the protein which is encoded by the heterologous gene. 

In a preferred embodiment, the process further includes the step of the selection of the 
transformants on a phenylalanine and/or tyrosine deficient medium. 

The recombinant production process according to the invention can be performed with 
nucleic acid molecule according to the invention and heterologous gene spatially separate 
from one another, where the two can be provided in two vectors (binary vector system) and 
can be transformed independently of one another. Alternatively, the nucleic acid molecule 
according to the invention, which codes for a polypeptide with chorismate mutase activity, 
and the heterologous gene are present in one vector. These cointegrate vectors have the 
advantage that the selectable marker gene and the heterologous gene are covalently bound to 
one another, so that all prototrophic clones after transformation also contain the heterologous 
gene, as a result of which the effort of screening is reduced. 

Promoters and vectors preferred according to the invention for the expression of the 
heterologous gene correspond to the promoters and vectors stated above for the expression of 
the nucleic acid molecule according to the invention. The vector optionally also contains a 
signal peptide-coding nucleic acid sequence, which causes the secretion of the recombinant 
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protein into the medium, and which is directly 5' bound to the heterologous gene. Also 
advantageous is the addition of a His-tag coding nucleic acid sequence to the 5' end of the 
heterologous gene, which after recombinant expression makes purification on a nickel chelate 
column possible. 

The invention further provides recombinant protems which are obtainable by the process 
stated above. 

It is intended to illustrate the invention with the following figures and examples, but in no 
way to limit it On the basis of the description and the examples, other embodiments are 
accessible to the skilled person, which are also included. 

Figure 1 shows the biosynthesis of the aromatic amino acids via the shikimate pathway. 

Figure 2 shows the ligation of the DNA fi-agments into the vector pRS426, The vector was 
linearised with the restriction enzyme BamHI and the genomic Sau3 A DNA fiagments from 
H, polymorpha were ligated into this cleavage site. This is possible, since the cleavage sites 
BamHI and Sau3 A are compatible. 

Figure 3 shows the growth comparison of cells transformed with 2 complementing DNA 
fragments. To highlight the different growth rates, cell material of the cells complemented 
with a 5 kb (pME1524) or a 1.8 kb (pME1525 subclone of pME1524) DNA fragment was 
diluted in water and 20 of each dilution stage was applied onto minimal medium. The 
optical density of the different dilution stages is the same in each case, however since only 
the fundamentally different growth rates had to be illustrated, no accurate growth parameters 
were determined. 

Figure 4 shows the autoradiograph of a hybridisation of chromosomal DNA from different 
fungi with a 1.8 kb DNA fragment from H. polymorpha, which is capable of complementing 
the phenylalanine/tyrosine auxotrophy of the 51 cerevisiae aro7A selection strain. Chromo- 
somal DNA from S, cerevisiae (1), A, nidulans (2) and K polymorpha (3) was cleaved with 
the restriction enzyme EcoRV and hybridised with a ^^P radioactively labelled DNA probe, 
prepared using the genomic 1 .8 kb DNA fragment from H. polymorpha. 
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Figure 5 shows the sequence of a genomic 1.8 kb fragment (SEQ ID N0:3) which is capable 
of complementing the phenylalanine/tyrosine auxotrophy of the & cerevisiae aro7A selection 
stmn. The 5' and 3' region are shown in small letters, the open reading frame (SEQ ID 
N0:1) and the primary sequence (SEQ ID N0:2) of the derived gene product in large letters. 

Figure 6 shows the amplification of the flanking regions of the HAR07-0RF. The flanking 
regions of the HAR07 gene were amplified with the primers OLSK34/T7 and OLSK35/T3 
respectively and then cleaved with Apal/Bgin and Bgin/Xbal respectively. 

This is possible, since the primers OLSK34 and OLSK35 contain Bglll recognition 
sequences. 

Figure 7 shows the production of various deletion constructs. The flanking regions of the 
HAR07-0RF were ligated into the Apal/Xbal cleaved vector pBluescriptKSII as Apal/Bglll 
and Bgin/Xbal fragments respectively. The vector pBluescript U KS is marketed by the firm 
Stratagene, USA. The vector was again opened with Bglll and the disruption cassettes 
cleaved with BamHI and Bglll respectively were ligated in. 
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Material and Methods 

1. Material 
1.1 Chemicals 

Chemicals for the preparation of solutions, buffers and media were obtdned from the firms 
Merck (Darmstadt, Germany), Boehringer Ingelheim Bioproducts (Heidelberg, Germany), 
Carl Roth GmbH & Co KG (Karlsruhe, Germany), Gibco BRL (Life Technologies Gmbli 
Karlsruhe, Germany), Fluka (Neu-Uhn, Germany) and Sigma-Aldrich Chemie GmbH 
(Steinheim, Germany). 

Restriction en2ymes, DNA-modifying enzymes and polymerases were purchased fi*om MBI 
Fermentas (Vilnius, Lithuania) and RNase A firom Boehringer Mannheim GmbH (Mannheim, 
Germany). As the DNA size standard the "1 kb DNA-ladder" (Gene Ruler Plus) from MBI 
Fermentas was used. Agarose was obtained from Roth. 

For the preparation of plasmid DNA from Escherichia coli and for the extraction of DNA 
from agarose gels. Kits from the firm Qiagen (Hilden, Germany) were used. 
Synthetic oligonucleotides were purchased from Nucleic Acid Products Supply Gottingen 
GmbH (Gottingen, Germany) and Gibco BRL (Life Technologies GmbH, Karlsruhe, 
Germany). 

Contract sequencing was performed by the firm MWG-Biotech GmbH (Ebersberg, 
Germany). 

L2 Strains, Plasmids and Oligonucleotides 

For cloning, the£. coli strain DH5a [F',<p80dlacZAM15,A(lacZY A-argF),U169,deoR,recAl, 
endAl,hsdR17,(rK-,mK'),supE44,X-,thi-l,gyrA96,relAl] was used (Woodcock, 1989). For 
this work, the H. pofymorpha strain RB 1 l(ura3) was used (Weydemann et aL, 1995). 

The 5. cerevisiae strains used are set out in table I . 

The plasmids and oligonucleotides needed for this work are listed in tables 2 and 3. 
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Strain 


GenoQ'pe 


Reference 


RH2185 


MAToukiro7::LEU2 suc2-69 ura3-52 leu2- 
3leu2-112his4-519 


Schnappauf et al., 1997 


RH1405 


MATasuc2-59 ura3-52 Ieu2-3 Ieu2-1 12 

his4-5I9 


Schnappaufetal., 1997 


AaDiel: &>. cerevisiae strains used ■ ' 


FJasnud 


Description 


Reference 


pRS426 ~ 


J /zo op snuttie vector, bla URA3 


SikorskiandHSeter, 
1989 


pBIuescript KS II 


2961 bp vector, bh, lacZ, Multiple Cloning 
Site (MCS) 


Stratagene 


PME1513 


p42mET25 with altered MCS {Sad 
pMET25 Xbal Spel Bamm Sail Sfil Noil 
XhofCYCIl^T) 


Probst, 1998 


PME1524 


5 kb &w3 A Fragment from H. p. in Bamm 
cleaved pRS426 




PME1525 


1.8 kb [^/wfl//r//&im]-fragment from pME 
I524inpRS426(SmaI) 




PME1526 

Table 2: Plasmids as 


pBIuescnpt H KS (ApaVHindSr) + 1.8 kb 
(ApaJJHindniyQragmetA from pMEl 524 | 
ied 





Oligonucleotide 


Size 


Sequence 


T3 


20-mer 


5'-(AATTAACCCTCACTAAAGGG>3' 


T7 


22-mer 


5'-(GTAATACGACTCACTATAGGGC)-3' 


OLSK34 


26-mer 


5'-(ATATAGATCTACAAA_AACrAAACAGG)-3' 


OLSK35 

Table 3: Olieonude 


28-mer 


5'-(ATATAGATCTGATGCG-ACGCAGAAAAGC)-3' 
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2, Methods 

2.1 Cultariog of the Microorganisms 
2Ad Escherichia coli 

The cells were cultured in Luria-Bertani medium (LB: 1% tryptone, 0.5% yeast extract, 1% 
NaCl) at For strains with the ampicillin resistance marker, 50 mg/1 ampicillin were 
added to the medium. 

2.1«2 Hansenulapotymorpha 

The ceils were cultured at 37^C either in "yeast extract peptone dextrose medium" (YEPD: 
2% peptone, 1% yeast extract, 2% ^ucose) or in "yeast-nitrogen-base medium" (YNB: 
0.15% yeast nitrogen base, 0.5% (NH4)2S04, 0.1% myo-inositol (200 mM), 5% glucose, 
supplemented with uracil for the present strain). All media were autoclaved before use and 
for solid media 2% agar was added. 

2A3 Saccharomyces cerevisiae 

The cells were cultured at SO'^C either in "yeast extract peptone dextrose medium" (YEPD: 
2% peptone, 1% yeast extract, 2% glucose) or in "minimal vitamins medium" (MV: 0.15% 
yeast nitrogen base, 0.52% ammonium sulphate, 2% glucose, 1% succinate, 0.3% KOH). 
Supplements such as L-amino acids or uracil were added after Sherman et al. (1986) and, like 
the antibiotics, sterile filtered or autoclaved and added to the sterile medium. For solid 
media, 2% agar was added. The growth of the yeast cells was followed by measurement of 
the optical density at 600 nm. 

2.2 Isolation of Nucleic Acids 

2*2 J Qiagen Plasmid DNA Preparation from Escherichia coli 

Firstly the cultured bacteria were centrifiiged off and the sediment formed was resuspended 
in 0.3 ml of buffer PI (buffer description as per manufacturer). Then 0.3 ml of buffer P2 
were added and the mixture incubated for 5 mins at room temperature. In the next step, 
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0.3 mi of the cooled buffer P3 were added and again incubated for 5 mins, this time on ice. 
Next, the mixture was centrifuged for 10 mins (12,000 rpm) and the supernatant added to a 
Quiagen-tip 20 column previously equilibrated with buffer QBT. After the supernatant had 
passed through the column, this was washed four times with 1 ml portions of buffer QC. The 
bound DNA was eluted with 0.8 ml of buffer QF. The eluted DNA was precipitated with 0.7 
volumes of isopropanol and centrifuged oflf (30 mui, 10,000 rpm). Finally the DNA was 
washed with 1 ml of 70% ethanol, dried and taken up in H2O. 

2.2.2 Isolation of Plasmid DNA from Escherichia coli (Birnboim and Doly, 1979) 

The £. coli cultures were cultured overnight in 5 ml of LB-Amp at 3TC on a rotary shaker. 
1 .5 ml of the culture were centrifuged off in an Eppendorf reaction vessel and the cells 
resuspended in 100 ^1 of solution I (50 mM glucose, 10 mM EDTA, 20 mM Tris-HCl, pH 
8.4, 4 mg/ml lysozyme). After 7 mins incubation at RT, 200 \i\ of solution 11 (0.2 mM 
NaOH, 1% w/v SDS) were added, the mixture gently shaken and incubated for 5 mins on ice. 
50 |il of solution UI (3 M Na acetate, pH 4.8) were added, the mixture again gently shaken 
and incubated a fiirther 7 mins on ice. The cell debris was centrifuged off and the super- 
natant extracted with 450 fil of CH2Cl2-saturated phenol in TE buffer (10 mM Tris-HCl, 
1 mM EDTA, pH 8.0) and then with 450 jil of CH2CI2. The plasmid was precipitated by 
addition of 1 ml of cold EtOH and incubation at -20T. After 30 mins centrifiigation at 4^C, 
the precipitate was washed with 200 \i\ of 70% EtOH, dried in vacuo and taken up in 50 ^1 
of TE buffer (incl. 25 fig/ml RNase A). To dissolve the DNA this was heated for 5 mins 
at 65X, and RNA degradation was effected using the heat-stable RNase A by incubation for 
20 mins at ZTC. The plasmid DNA solution was stored at -20^C, 

2.2.3 Isolation of Chromosomal DNA from Hamemla polymorpha 
(Hofman and Winston, 1987) 

10 ml of YEPD medium were inoculated and incubated for about 15 hrs at 37°C. The cells 
were centrifuged off, resuspended in 0.5 ml of distilled water and again centrifuged off The 
supernatant was discarded and 0.2 ml of lysis buffer (2% Triton X-100, 1% SDS, 100 mM 
NaCl, 10 mM Tris, pH 8.0, 1 mM EDTA), 0.2 ml of phenol/MeCbAE and 0.3 g of glass 
spheres were added. To disintegrate the cells this was shaken for 3-4 mins and then the cell 
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debris was centrifuged off (5 min, 13,000 rpm). The aqueous phase was placed in a new 
container and after addition of 1 ml of ethanol the DNA was precipitated and again 
centrifuged ofE The sediment was resuspended in 0,4 ml of TE (10 mM Tris-HCl, 1 mM 
EDTA, pH 8.0) and 3 ^1 of RNase A (10 mg/ml) were added, whereupon the mixture was 
incubated for 5 mins at 'iTC, Now 10 jil of 4M ammonium acetate and 1 ml of ethanol were 
added, the precipitated DNA was centrifuged off, the supernatant discarded, dried and taken 
up in 50 vX TE [sic]. 

2.3 Cloning Techniques 

2.3,1 Polymerase Chain Reaction CPCR'; Saild et aL, 1985) 

Polymerase chain reactions were performed with the heat-stable enzyme Taq-polymerase 
(Fermentas) in GeneAmpTM reaction vessel (Eppendorf). Usually each time 5-50 nmol of 
primer oligonucleotide and 10-100 ng of DNA as matrix were used in 20-50 ^1 of reaction 
buffer in accordance with the manufacturer's instructions. As a rule, 30 cycles of the 
following temperature regime were performed: 30 sees denaturation at 94''C / 30 sees 
hybridisation at specific reaction temperature / 30 sees - 3 mins synthesis at ITC for Taq- 
DNA polymerase. The PCR cycles were initiated by a 3 minute denaturation step and 
finished by a final synthesis step of 5 mins. 

2.3-2 BNA Restriction 

For analytical restriction reactions, ca. 0.5 ^g of DNA were incubated with 1-2 units of 
restriction enzyme in a volume of 20 |xl for 2-3 hrs at 37^0. For the restriction of preparative 
quantities of DNA, correspondingly larger volumes and quantities of enzyme were used. 
Reaction buffers were used in accordance with the manufacturer's instructions. 

2.3.3 Agarose Gel Electrophoresis 

0. 1 vol. of DNA dye (25% w/v FicoU 400, 0.25% w/v Bromphenol blue, 0,25% w/v 
xylenecyanol, 200 mM EDTA, pH 8.0) were added to the restriction mixture, and the DNA 
fragments were separated electrophoretically in a horizontal agarose gel in TAE buffer 
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(40 mM Tris acetate, 20 mM NaOAc, 2 mM EDTA, pH 8.3) in the presence of 0.5 ^ig/ml 
ethidium bromide. DNA bands were detected with a UV Transilluminator (254 mm). 

2.3.4 Isolation of DNA Fragments from Agarose 

To isolate DNA fragments from the agarose gel, the columns and buffers of the QIAquick 
Gel Extraction Kit Protocol were used in accordance with the manufacturer's use directions. 
For this, the DNA fragments were firstly cut out from the gel and weighed. 3 volumes of the 
buffer QXl were added, and incubated for 10 mins at SO^'C. As soon as the gel fragment had 
completely dissolved, 1 volume of isopropanol was added and the mixture was placed on the 
QIAquick column and centrifiiged off. To wash the DNA bound onto the column, 0.75 ml of 
PE buffer were added and again centri&ged. Finally, the DNA was eluted with 50 \xl of H2O. 

23.5 Ligation of DNA Fragments (Maniatis et ai, 1989) 

Linear DNA fragments with sticky or smooth ends were ligated in a reaction mixture in a 
total volume of 20 |al with ligation buffer (20 mM Tris-HCl, 10 mM MgCh, 10 mM DTT, 
0.6 mM ATP, pH 7.6) and 5 units of T4-DNA ligase overnight at 15^C or for 5 hrs at room 
temperature. The concentration of DNA was between 1 and 10 M.g/ml, and the vector/insert 
DNA mole ratio between 1 :5 and 1:10. Following the ligation reaction, the DNA was used 
for transformation without further purification. 

2.4 Transformation Methods 

2AA Transformation of £ coU (Inoue et aL, 1990) 

Cells of a 250 ml SOB culture (2% tryptone, 0,5% yeast extract, 10 mM NaCl, 2.5 mM KCI, 
10 mM MgCla, 10 mM MgS04) in the exponential phase were centrifiiged off for 10 mins at 
4X (lOOOxg) and resuspended in 80 ml of TB buffer (10 mM PIPES, 15 mM CaCkxHaO, 
250 mM KCI, pH 6.7, 55 mM MnCl2xH20). After incubation (10 mins on ice), the cells were 
again centrifiiged off at 4^C and 2,500 rpm. Then the sediment was carefiilly resuspended in 
20 ml of TB buffer and also carefully mixed thoroughly with 7% DMSO. After 10 min 
incubation on ice, the now competent cells were aliquotted, shock-frozen in hquid N2 and 
stored at -70''C. 200 ii\ of competent cells were added to 1-10 jig of DNA and incubated on 
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ice for 30 mins. After 30 sees heat shock at 42°C, the mixture was chilled on ice, treated 
with 800 iLil of SOC medium (SOB, incl 20 mM glucose) and shaken for 1 hr at 37°C. 
Various amounts (10-900 \xl) of the cell culture were plated out on solid selective medium 
(agar plates with LB complete medium, supplemented with 50 ^g/ml ampicillin) and finally 
the plates incubated overnight at 37**C to give transformed cells time for colony formation. 

2A2 SL cerevmae Transformation (modified after Ito et aL, 1983) 

0.5 ml of a fresh yeast culture, grown overnight, were used per transformation mix. The cells 
were centrifliged off at room temperature (5 mins, 3,500 rpm) and the supernatant was 
removed. Now the transforming DNA and 50 \ig of carrier DNA were added and mixed. 
Finally, 0,5 ml of PEG (40% PEG3350, 0,lM LiOAc, 10 mM Tris pH 7,5, 1 mM EDTA, 
0. 1 M DTT) were added, and the transformation mixes were shaken and incubated for at least 
12 hrs at room temperature. After a 20 min heat shock (42°C), the cells were carefully 
centrifiiged, and incubated in 1 ml YEPD for 1 hr at 30''C. Finally the cells were centrifiiged 
for 5 sees in the bench centrifuge, resuspended in the remaining liquid after discarding of the 
supernatant and plated out on selection media. 

2.5 Hybridisation Techniques 

2*5 J Southern Hybridisation (Southern, 1975) 

For the Southern hybridisation, ca. 10 jag of chromosomal DNA were dissolved in TE buffer 
(10 mM Tris-HCl, 1 mM EDTA, pH 8.5) and subjected to restriction digestion for 12 hrs 
with a suitable restriction enayme. The cleaved DNA was separated in a 1% agarose gel. 
The agarose gel was washed 3 times, for 20 mins each time, firstly in 0.25M HCl, then in 5M 
NaOH/I .OM NaCl and finally in IM NHtOAc. Now the separated DNA was transferred by 
12-hour dry-blotting onto a nitrocellulose membrane. This was next washed for 2 mins in 
2 X SSC (0. IM NaCl, 10 mM NaOAc, pH 7.0), dried in air, and finally "crosslinking" was 
performed for 5 mins under UV light (254 nm). 

The membrane was prehybridised for 30 mins at eS^'C with 50 ml of Church buffer (7% SDS, 
1% BSA, 1 mM EDTA, 250 mM Na phosphate, pH 7,2), after which half of the hybridisation 
solution was discarded. Now the radioactively-labelled DNA probe was added and the 
hybridisation was performed overnight at 65°C. Following the hybridisation, the membrane 
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was washed for 30 mins at 65**C with 2 x SSC/0.1% SDS and then exposed on an Xray film 
or on a phosphor imaging plate. 

2*5.2 Preparation of the Probe DNA 

For the preparation of the probe DNA, the Prime-It (Random Primer Labeling Kit) from the 
firm Stratagene was used in accordance with the manufacturer's instructions. The DNA 
required was previously amplified by PCR and then used for the probe preparation. 

2.6 Characterisation of the Chorismate Miitase from Hansenula polymorpha 
2.6*1 Crude Extract Preparation 

The yeast cells were cultured in YNB medium, individually supplemented for the particular 
yeast strains, and harvested at an OD546 of ca. 4. Crude extracts were prepared as described 
by Kradolfer et al. (1977), using an Amicon French Press. 

2*6*2 Determination of Protein Content 

The protein content of the protein solutions was determined by the method of Bradford 
(1 976) with BS A as protein standard, 

2.63 Measurement of Enzyme Activity 

For the measurement of the chorismate mutase activity, stop tests, such as those developed by 
Schmidheini et al. (1989), with a few modifications were performed. All enzyme tests and 
extinction measurements were performed at The 500 ^il reaction mixes contained 
50 mM potassium phosphate buffer, pH 7.6, 2 mM EDTA, 20 mM DTT and 10 [il of the 
protein fraction concerned. The reaction, i.e. the conversion of chorismate to prephenate, was 
started by addition of chorismate at a final concentration of 1 mM and stopped after 10 mins 
with 500 |il of IM HCl, which also effects the conversion of prephenate into phenylpynivate 
which is measurable at OD320. 



27 



Neutralisation by addition of 4 ml of IM NaOH ended the reaction. The measurement of the 
OD320 was performed against H2O, and similar solutions without enzyme were used as null 
values. The measurements were also performed in the presence of 5 pM L-tryptophan. The 
null values of the measurements without enzyme were subtracted from the absorption values. 
With the molecular extinction coeflBcient of phenylpymvate at 30^C of 13095 nl/(mol x cm), 
the conversion rate (jimol product/(min x mg protein)) and hence the specific activity 
(conversion/mg protein) could be calculated. 

Results 

1. Preparation of a Genomic Gene Bank from Hansenula polymorpha 

For the preparation of a genomic gene bank, chromosomal DN A from Hansenula poly- 
morpha RBll was isolated and this was subjected to partial restriction with the enzyme 
Sau3 A. The recognition sequence of this enzyme is relatively frequently present in the 
genome, since it consists of only 4 bases. During the 5-hour reaction time, aliquots were 
taken at 15 min intervals, and the reaction in each stopped by addition of EDTA. In this way, 
it was ensured that the chromosomal DNA was cleaved into fragments of different size. The 
cleaved DNA was separated in an agarose gel, divided into different size fractions between 
1 kb and 10 kb, and correspondingly extracted from the gel. The extracted fragment fractions 
were ligated into the vector pRS426 (Sikorski and Hieter, 1989), previously linearised with 
BamHI (figure 2). 

The plasmid library thus obtained was transformed into E coli, during which the antibiotic 
resistance encoded by the bla-gene was selected for in the presence of ampicillin. Overall, 
about 1 50,000 transformants were obtained. These were washed from the plates with LB 
medium and stored at -20^C after addition of glycerine. 

2. A GeEomic l.S kb BNA Fragment from H polymorpha Complements the aroJA- 
Phenotype in S, ceremiae 

The plasmids of the genomic H. polymorpha bank were isolated from E. coli and transformed 
into the S. cerevisiae strain RH2185 (MATa(Aaro7::LEU2 suc2-59 ura3-52 leu2-3 leu2-l 12 
his4-519), in order to complement its Tyr/Phe auxotrophy, which is attributable to a deletion 
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in the AR07 gene. The transformed yeast cells were firstly selected for uracil prototrophy 
and thus for the presence of the URA3 gene of the vector pRS426 (SC~Ura) and then 
transferred to minimal medium (YNB + Trp + His). After about 5 incubation stages at 30°C, 
3 transformed yeast colonies were isolated which were capable of growing in the absence of 
uracil, tyrosine and phenylalanine. The plasmids were isolated and even after a retransform- 
ation were capable of complementing the AR07 deletion in RH2185. By restriction analysis, 
an approximately 5 kb additional DNA fragment was identified in the vector pRS426. For 
fixrther localisation of the DNA fi^agment, a subcloning was performed wherein the fragments 
obtained were again ligated into the vector pRS426. These different plasmids were now, 
analogously to the plasmids of the genomic bank, examined for their ability to complement 
the Tyr/Phe auxotrophy in REE 1 85. An approximately 1 . 8 kb Apal/Xbal fi-agment was thus 
obtained, which was capable of doing this. This finding was confirmed by repeated retrans- 
formation. It was striking that the 1.8 kb H, polymorpha DNA fragment (in the plasmid 
pME1525) in transformed yeast cells showed growth comparable with the wild type, while 
the 5 kb K polymorpha fragment (in the plasmid pME1524) in transformed yeast cells 
resulted in markedly slower growth (figure 3). 

In order to confirm that the complementing 1.8 kb fragment originated icom Hamenula 
polymorpha, this was used as the probe in a Southern hybridisation. The chromosomal DNA 
from K polymorpha, A. nidulans and 5. cerevisiae was cleaved with the restriction en2yme 
EcoRV and after hybridisation with the probe a clear signal could be seen with H. poly- 
morpha, a weak signal with 5. cerevisiae and no signal with A. nidulans. Furthermore, it 
could be shown by the Southern hybridisation that the gene occurs only once in the genome 
of K polymorpha (figure 4). 

The 1 .8 kb DNA fragment from K polymorpha RBI 1 was sequenced completely and an 
open reading frame consisting of 843 bp could be identified (figure 5), which showed a 58% 
agreement with the AR07 gene from S. cerevisiae. It can thus be assumed that this is the 
gene of the yeast Hansenula polymorpha homologous to AR07, which is referred to below as 
HAR07. 
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3* Preparation of Various Deletion Constructs 

On the basis of the sequence of the 1 .8 kb DNA fragment from H. pofymorpha^ the primers 
OLSK34 and OLSK35 were used for the preparation of various haro7A-deletion constructs. 
The various deletion constructs were transformed into H, polymorpha, in order to confirm the 
identity of the gene and to confirm that that no isoenzymes of chorismate mutase exist in H. 
polymorpha. 

By deletion of HAR07 in Hcmsemla polymorpha, the identity of this gene can be conclus- 
ively proved and the existence of alternative genes checked. At the same time, the construct- 
ion of such a deletion mutant opens up new selection possibilities. For the preparation of a 
haro7A deletion strain, three different disruption cassettes were constructed. For the 
necessary integration into the Hansemla polymorpha genome, the flanking re^ons of the 
HAR07 gene, which should effect the deletion of the HAR07 gene by homologous 
recombination, were used for all three constructs. The flanking region in the 3' direction was 
amplified using the primers OLSK34 and T7, and next cleaved with the restriction enzymes 
Apal and Bglll. The flanking region in the 5' direction was amplified using the primers 
OLSK35 and T3, and cleaved with the restriction enzymes Xbal and Bgin. This is possible 
as the primers OLSK34 and OLSK35 contain a Bgill cleavage site (figure 6). 

Both fragments were ligated into the Apal/Xbal cleaved vector pBluescript KSII and then the 
vector was again opened with Bglll, in order to introduce various disruption cassettes 
(figure 7). 

The hisG::URA3::hisG disruption cassette (Schneider et al., 1996) was cleaved with Bglll 
and ligated into the vector described above. By means of the URA3 marker, integration of 
the cassette into the genome of i/. polymorpha can be checked. Whether a homologous 
integration, i.e. a deletion of HAR07, has in fact taken place can be shown by selection for 
phenylalanine/tyrosine auxotrophy and by Southern hybridisation. The hisG components 
favour the conclusive excision of the URA3 marker, which can be checked by counter- 
selection with 5-FOA (5-fluoroorotic acid), since the URA3 gene product causes a toxic 
conversion of 5-FOA so that only cells without the URA3 marker are capable of surviving 
(Boekeetal., 1984). 
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The loxP::kanMX::IoxP disruption cassette (Guldener et al., 1996) was cleaved with BamHI 
and ligated into the vector described above. By means of the kanMX marker, integration of 
the cassette into the genome of Hansemla polymorpha can be checked by selection for 
kanamycin resistance (G418). The kan*" gene is flanked by the translation elongation fector 
(TEF) promoter and terminator (Steiner and Phiiippsen, 1994) of the filamentous fiingus 
Ashbya gossypii. Whether a homologous integration has in fact taken place can be shown by 
selection for phenylalanine/tyrosine auxotrophy and by Southern hybridisation. In this 
system, the possibility exists of removing the marker again from the genome after the 
deletion has taken place. This occurs through the Cre-loxP recombination system of the 
bacteriophage PI (Guldener et aL, 1996). The plasmid pSH47 (Austin et al., 1981) contains 
the Cre recombinase gene with a GALl promoter connected upstream and in addition the 
components ARS, CEN and a URA3 marker. Galactose induces the expression of the 
recombinase and this elffects the removal of the marker from the genome by recombination of 
the flanking loxP regions. 

The loxP::ODCl::loxP disruption cassette is analogous to the IoxP::kanMX::IoxP disruption 
cassette (Guldener et al., 1996), except that the kanMX gene has been replaced by the ODCl 
gene (=URA3) from Hansenula polymorpha (M Kller, personal communication). In this 
case also, the possibility exists of removing the marker again from the genome after the 
deletion has taken place. The removal of the marker can in this case be checked by selection 
on 5-fluoroorotic acid (5-FOA) medium, since the ODCl gene product causes atoxic 
conversion of S-FOA so that only cells without the ODCl marker are capable of surviving 
(Boekeetal, 1984). 

The transformation of the hisG::URA3::hisG disruption cassette into H. polymorpha KBl I 
resulted in a mutant which had a tyrosine/phenylalanine auxotrophy and a uracil prototrophy, 
and thus possessed the phenotype to be expected. 

4, Esasfiple of RecombinaEt Expression of Proteins Using Auxotrophic Yeast Strains 
and Functional Complementation for Plasmid Selection: 

For the expression of a recombinant protein in K polymorpha using a nucleic acid molecule 
according to the invention, suitable expression plasmids can be prepared. For example, the 
plasmid pFPMTi2l from the firm Rhein Biotech GmbH serves as the basis for this. This 
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, * ;n w nrt/vmom/w after transformation 
™ » o fi.^«, ? rprevisiae for selection in H. poiymuryn^ a* 
bearstheURA3genefrom5.cerew5/aei iggsUodcl oruraS). 

^t,oc*.o RBlKWevdemannetal , lyy^Hoo^i"'" 
nfiiracil-auxotrophic strains such as e.g. KB luwcy 

^Lpe»ee(Q«e*ee.a.. ,994, serves as *e «P«ss.on .o<^.e. 

T o s VI. DNA fragment can be isolated from 
Aft«r digestion with the restriction enzyme Ndel, a 5 kb DNA tragme 

w«h1e marker gene the chorismate mutase gene (HAR07) irom a / 
as a selectable marker gene X Th*> election is performed for prototrophy 

which codes for the enzyme chorismate mutase. The selection pe 
with respect to tyrosine and phenylalanine. 

, • 1 thel3kbEc.72y55^IfragmentcanbeclonedintotheplasmidpFPMTl2l 
Alternatively,thel.^kbi^co/^iy y a nlasmid which bears two 

UnearisedwithS^nal. This procedure results m an expression plasmid w 

marker genes, tm3 and a4i?07. 

As an example of suchacontrol sequence. theMFai genetro 
1998) may be mentioned. 

i„ „Hich *e e.pressio„ n,odu,e ,s P^"^ -^^^^^^^ 

preferably rates place after cuta™ ^ «,ese con^iHons. .here is derepression 

,o.ediumwi.h glycerine asrheonlyCsource. ""^ R^„,,.hU 
of FMD promoter, linked with strong expression of the heterolog 
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can be isolated in pure fonn from the purified culture supernatant by standard chromato- 
graphic methods. 

5« The 1.8 kb Fragment from Hansenula pofymorpha Codes for a Protein with 
Chorismate Matase Activity 

To check whether the L8 kb fragment contains a gene which codes for a protein with 
chorismate mutase activity, cell extracts of different plasmid-bearing yeast strains w^e 
prepared using a French Press and their chorismate mutase activities were measured. For the 
measurement of the specific enzyme activity, cell extracts from 51 cerevisiae RH2185 
(pRS426), K pofymorpha RBI 1, & cerevisiae RH2185(pME1524) and SI cerevisiae 
RH2l85(pME1525) were prepared. It could be shown that both the 5 kb fragment 
(pME1524) and also the 1 .8 Id> fragment (pME1525) from H. pofymorpha code for genes for 
a chorismate mutase activity (table 4), 



<■ Specific Ac(Hit^:(U/nig)'fi^,,. 


f% Spedfic Act 

• ■■•'■-^ai 

liS'---'-' " (+,tryp< 




S. cerevisiae- 


0 


S. cerevisiae- 


0 


RH2185 + pRS426 




RH2I85 + pRS426 




H, polymorpha- 


0.379 


H. polymorpha- 


0.349 


RBll 




RBll 




S. cerevisiae- 


0.191 


S. cerevisiae- 


0.127 


RH2I85 + pME1524 




RH2185 + pME1524 




51 cerevisiae- 


0.469 


S. cerevisiae- 


0.263 


RH2185 + pME1525 




RH2185 + pME1525 





Tab,4: Specific chorismate mutase activities (U/mg protein) of various cell extracts* 
The table shows the mean value of 4 activity measurements in each case. The values for cell 
extracts from S. cerevisiae RH2185+pRS426, S. c.+pME1524, 5, c + pME1525 mdH, pofy- 
morpha KB 1 1 were measured. The measurements were in each case performed with and 
without 500 fxM tryptophan. 
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6. Identification of the HAR07 Gene of the Yeast Hansemla polymorpha 

The complementing 1,8 kb DNA fragment was completely sequenced using a T3 and a T7 
primer (figure 5), 

The en2yme chorismate mutase is encoded in the yeast Hansemla polymorpha by a gene 
which consists of 843 bp and has been named HAR07. This gene was cloned by 
complementing the phenylalanine/tyrosine auxotrophy of a Sacchctromyces cerevisiae aroT A 
deletion strain with a genomic gene bank from Hansemla polymorpha. Hence the H 
polymorpha promoter is functional in X cerevisiae. No introns are present, and a 32 kDA 
protein consisting of 280 amino acids can be derived from the sequence. The amino acid 
sequence of the chorismate mutase from Hansemla polymorpha shows 58% identity to that 
from Saccharomyces cerevisiae and 44% to that from Aspergillus nidukms, A striking 
feature in the comparison of these amino acid sequences is an endpiece consisting of 23 
amino acids in the sequence from H. polymorpha^ which is present neither in S. cerevisiae 
nor mA, nidulans, nor in other eukaryotic chorismate mutase amino acid sequences. 
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